A target approximation intonation model for yorùbá TTS

نویسندگان

  • Daniel R. van Niekerk
  • Etienne Barnard
چکیده

A complete intonation model based on quantitative target approximation is described for Yorùbá text-to-speech (TTS) synthesis. This model is evaluated analytically and perceptually and compared to a fundamental frequency (F0) model using the standard HTS implementation. Analytical results suggest that the proposed approach more efficiently models F0 contours given typical data constraints in under-resourced environments and perceptual results comparing the proposed model with HTS are encouraging.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maximum-likelihood dynamic intonation model for concatenative text-to-speech system

In this work we present a Maximum Likelihood (ML) joint pitch curve modeling, inspired by HMM TTS synthesis concept. This model provides an optimal solution for the coarse target intonation curve (3 points per syllable) and incorporates both static and dynamic pitch values for better utterance intonation modeling. The coarse intonation curve may be optionally combined with the original pitch ex...

متن کامل

Design Issues in Automatic Grapheme-to-Phoneme Conversion for Standard Yorùbá

Grapheme-to-Phoneme (G2P) conversion is an important problem in Human Language Processing development, particularly Textto-Speech (TTS). Its primary goal is to accurately compute the pronunciation of words in the input texts. This work examines design issues with respect to components of the automatic G2P for standard Yorùbá (SY). The automatic process includes: (i) Tokenisation of Input, (ii) ...

متن کامل

F0 stylization and intonation modelling for Standard Yorùbá Text-to-speech application

This technical report documents experiment into stylization of the f0 curve on Standard Yorùbá (SY ) syllables as well as a technique for intonation modelling. A number of interpolation polynomials were evaluated using root mean square error and mean opinion score techniques. The stylisation experiment resulted in the selection of a 3 degree polynomial for modelling the f0 curves on Yorùbá syll...

متن کامل

Inventory of intonation contours for text-to-speech synthesis

This paper presents an intonation model which determines intonation contours over intonation phrases. The model is described by four elements: communicative type of an intonation phrase; number of accent groups in it; position of the nuclear accent group in it; and set of target intonation points. Individualization of the model is based on semiautomatic analysis of speaker database. The model w...

متن کامل

Generating fundamental frequency contours for speech synthesis in yorùbá

We present methods for modelling and synthesising fundamental frequency (F0) contours suitable for application in textto-speech (TTS) synthesis of Yorùbá (an African tone language). These methods are discussed and compared with a baseline approach using the HMM-based speech synthesis system HTS. Evaluation is done by comparing ten-fold cross validation squared errors on a small corpus of four s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014